"Nguyen" meaning in All languages combined

See Nguyen on Wiktionary

Proper name [Czech]

Etymology: Borrowed from Vietnamese Nguyễn. Etymology templates: {{bor+|cs|vi|Nguyễn}} Borrowed from Vietnamese Nguyễn Head templates: {{cs-proper noun|m-an}} Nguyen m anim
  1. a surname from Vietnamese Tags: animate, masculine
    Sense id: en-Nguyen-cs-name-OGYYaKod Categories (other): Czech entries with incorrect language header, Czech surnames, Pages with 3 entries

Proper name [English]

IPA: /wɪn/, /n(ə)ˈwɪn/, /ˈnjʉwən/ [General-Australian], /wɛn/ [General-Australian], /nuˈjɛn/ [US], /n(ə)-/ [US], /wɛn/ [US], /ŋwɪn/ [uncommon], /n(ə)ˈɡujən/ [uncommon], /ɲuˈɛn/ [uncommon] Forms: Nguyens [plural]
Rhymes: -ɪn, -ʉwən, -ɛn, -ujən Etymology: Borrowed from Vietnamese Nguyễn. Etymology templates: {{bor|en|vi|Nguyễn}} Vietnamese Nguyễn Head templates: {{en-proper noun|Nguyens}} Nguyen (plural Nguyens)
  1. A Vietnamese surname. Translations (Vietnamese surname): Нгуен (Nguen) (Bulgarian), (jyun²) (Chinese Cantonese), (Ruǎn) (Chinese Mandarin), Nguyen (Czech), Nguyên (Czech), Nguyen (French), Nguyên (French), Nguyen [feminine, masculine] (German), נגויין (Hebrew), グエン (Guen) (Japanese), ង្វៀន (ngviən) (Khmer), 응우옌 (eung'uyen) (Korean), 응웬 (eung'wen) (Korean), Nguyē̆nus [New-Latin, masculine] (Latin), Nguyē̆na [New-Latin, feminine] (Latin), Нгуен (Ngujen) (Russian), เหงวียน (ngwǐian) (Thai), เหงียน (ngǐian) (Thai), Nguyễn (Vietnamese)

Proper name [French]

IPA: /ɛn.ɡɥi.jɛn/, /nɥi.jɛn/, /ɡɥi.jɛn/ Audio: Fr-Nguyen.oga , Fr-Normandie-Nguyen.ogg
Etymology: Borrowed from Vietnamese Nguyễn. Etymology templates: {{bor+|fr|vi|Nguyễn}} Borrowed from Vietnamese Nguyễn Head templates: {{fr-proper noun|g=mf}} Nguyen m or f
  1. a surname from Vietnamese Wikipedia link: fr:Nguyen Tags: feminine, masculine Synonyms: Nguyên
    Sense id: en-Nguyen-fr-name-OGYYaKod Categories (other): French entries with incorrect language header, French surnames, Pages with 3 entries

Inflected forms

Alternative forms

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "vi",
        "3": "Nguyễn"
      },
      "expansion": "Vietnamese Nguyễn",
      "name": "bor"
    }
  ],
  "etymology_text": "Borrowed from Vietnamese Nguyễn.",
  "forms": [
    {
      "form": "Nguyens",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "Nguyens"
      },
      "expansion": "Nguyen (plural Nguyens)",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "name",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English surnames",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Entries with translation boxes",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 3 entries",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Bulgarian translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Cantonese translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Czech translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with French translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with German translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Hebrew translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Japanese translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Khmer translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Korean translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Latin translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Mandarin translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Russian translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Thai translations",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Terms with Vietnamese translations",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "A Vietnamese surname."
      ],
      "id": "en-Nguyen-en-name-UEuNtsEU",
      "links": [
        [
          "surname",
          "surname"
        ]
      ],
      "translations": [
        {
          "code": "bg",
          "lang": "Bulgarian",
          "roman": "Nguen",
          "sense": "Vietnamese surname",
          "word": "Нгуен"
        },
        {
          "code": "yue",
          "lang": "Chinese Cantonese",
          "roman": "jyun²",
          "sense": "Vietnamese surname",
          "word": "阮"
        },
        {
          "code": "cmn",
          "lang": "Chinese Mandarin",
          "roman": "Ruǎn",
          "sense": "Vietnamese surname",
          "word": "阮"
        },
        {
          "code": "cs",
          "lang": "Czech",
          "sense": "Vietnamese surname",
          "word": "Nguyen"
        },
        {
          "code": "cs",
          "lang": "Czech",
          "sense": "Vietnamese surname",
          "word": "Nguyên"
        },
        {
          "code": "fr",
          "lang": "French",
          "sense": "Vietnamese surname",
          "word": "Nguyen"
        },
        {
          "code": "fr",
          "lang": "French",
          "sense": "Vietnamese surname",
          "word": "Nguyên"
        },
        {
          "code": "de",
          "lang": "German",
          "sense": "Vietnamese surname",
          "tags": [
            "feminine",
            "masculine"
          ],
          "word": "Nguyen"
        },
        {
          "code": "he",
          "lang": "Hebrew",
          "sense": "Vietnamese surname",
          "word": "נגויין"
        },
        {
          "code": "ja",
          "lang": "Japanese",
          "roman": "Guen",
          "sense": "Vietnamese surname",
          "word": "グエン"
        },
        {
          "code": "km",
          "lang": "Khmer",
          "roman": "ngviən",
          "sense": "Vietnamese surname",
          "word": "ង្វៀន"
        },
        {
          "code": "ko",
          "lang": "Korean",
          "roman": "eung'uyen",
          "sense": "Vietnamese surname",
          "word": "응우옌"
        },
        {
          "code": "ko",
          "lang": "Korean",
          "roman": "eung'wen",
          "sense": "Vietnamese surname",
          "word": "응웬"
        },
        {
          "code": "la",
          "lang": "Latin",
          "sense": "Vietnamese surname",
          "tags": [
            "New-Latin",
            "masculine"
          ],
          "word": "Nguyē̆nus"
        },
        {
          "code": "la",
          "lang": "Latin",
          "sense": "Vietnamese surname",
          "tags": [
            "New-Latin",
            "feminine"
          ],
          "word": "Nguyē̆na"
        },
        {
          "code": "ru",
          "lang": "Russian",
          "roman": "Ngujen",
          "sense": "Vietnamese surname",
          "word": "Нгуен"
        },
        {
          "code": "th",
          "lang": "Thai",
          "roman": "ngwǐian",
          "sense": "Vietnamese surname",
          "word": "เหงวียน"
        },
        {
          "code": "th",
          "lang": "Thai",
          "roman": "ngǐian",
          "sense": "Vietnamese surname",
          "word": "เหงียน"
        },
        {
          "code": "vi",
          "lang": "Vietnamese",
          "sense": "Vietnamese surname",
          "word": "Nguyễn"
        }
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/wɪn/"
    },
    {
      "ipa": "/n(ə)ˈwɪn/"
    },
    {
      "ipa": "/ˈnjʉwən/",
      "tags": [
        "General-Australian"
      ]
    },
    {
      "ipa": "/wɛn/",
      "tags": [
        "General-Australian"
      ]
    },
    {
      "ipa": "/nuˈjɛn/",
      "tags": [
        "US"
      ]
    },
    {
      "ipa": "/n(ə)-/",
      "tags": [
        "US"
      ]
    },
    {
      "ipa": "/wɛn/",
      "tags": [
        "US"
      ]
    },
    {
      "ipa": "/ŋwɪn/",
      "tags": [
        "uncommon"
      ]
    },
    {
      "ipa": "/n(ə)ˈɡujən/",
      "tags": [
        "uncommon"
      ]
    },
    {
      "ipa": "/ɲuˈɛn/",
      "tags": [
        "uncommon"
      ]
    },
    {
      "rhymes": "-ɪn"
    },
    {
      "rhymes": "-ʉwən"
    },
    {
      "rhymes": "-ɛn"
    },
    {
      "rhymes": "-ujən"
    },
    {
      "homophone": "win"
    },
    {
      "homophone": "wynn"
    },
    {
      "homophone": "winne"
    }
  ],
  "word": "Nguyen"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "cs",
        "2": "vi",
        "3": "Nguyễn"
      },
      "expansion": "Borrowed from Vietnamese Nguyễn",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Vietnamese Nguyễn.",
  "head_templates": [
    {
      "args": {
        "1": "m-an"
      },
      "expansion": "Nguyen m anim",
      "name": "cs-proper noun"
    }
  ],
  "lang": "Czech",
  "lang_code": "cs",
  "pos": "name",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "Czech entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Czech surnames",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 3 entries",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "a surname from Vietnamese"
      ],
      "id": "en-Nguyen-cs-name-OGYYaKod",
      "links": [
        [
          "surname",
          "surname"
        ]
      ],
      "tags": [
        "animate",
        "masculine"
      ]
    }
  ],
  "word": "Nguyen"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "fr",
        "2": "vi",
        "3": "Nguyễn"
      },
      "expansion": "Borrowed from Vietnamese Nguyễn",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Vietnamese Nguyễn.",
  "head_templates": [
    {
      "args": {
        "g": "mf"
      },
      "expansion": "Nguyen m or f",
      "name": "fr-proper noun"
    }
  ],
  "lang": "French",
  "lang_code": "fr",
  "pos": "name",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "French entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "French surnames",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 3 entries",
          "parents": [],
          "source": "w"
        }
      ],
      "glosses": [
        "a surname from Vietnamese"
      ],
      "id": "en-Nguyen-fr-name-OGYYaKod",
      "links": [
        [
          "surname",
          "surname"
        ]
      ],
      "synonyms": [
        {
          "word": "Nguyên"
        }
      ],
      "tags": [
        "feminine",
        "masculine"
      ],
      "wikipedia": [
        "fr:Nguyen"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ɛn.ɡɥi.jɛn/"
    },
    {
      "ipa": "/nɥi.jɛn/"
    },
    {
      "ipa": "/ɡɥi.jɛn/"
    },
    {
      "audio": "Fr-Nguyen.oga",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/3/3c/Fr-Nguyen.oga/Fr-Nguyen.oga.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/3/3c/Fr-Nguyen.oga",
      "text": "Paris"
    },
    {
      "audio": "Fr-Normandie-Nguyen.ogg",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/9/98/Fr-Normandie-Nguyen.ogg/Fr-Normandie-Nguyen.ogg.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/9/98/Fr-Normandie-Nguyen.ogg",
      "text": "Normandy"
    }
  ],
  "word": "Nguyen"
}
{
  "etymology_templates": [
    {
      "args": {
        "1": "cs",
        "2": "vi",
        "3": "Nguyễn"
      },
      "expansion": "Borrowed from Vietnamese Nguyễn",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Vietnamese Nguyễn.",
  "head_templates": [
    {
      "args": {
        "1": "m-an"
      },
      "expansion": "Nguyen m anim",
      "name": "cs-proper noun"
    }
  ],
  "lang": "Czech",
  "lang_code": "cs",
  "pos": "name",
  "senses": [
    {
      "categories": [
        "Czech animate nouns",
        "Czech entries with incorrect language header",
        "Czech lemmas",
        "Czech masculine nouns",
        "Czech proper nouns",
        "Czech surnames",
        "Czech surnames from Vietnamese",
        "Czech terms borrowed from Vietnamese",
        "Czech terms derived from Vietnamese",
        "Pages with 3 entries"
      ],
      "glosses": [
        "a surname from Vietnamese"
      ],
      "links": [
        [
          "surname",
          "surname"
        ]
      ],
      "tags": [
        "animate",
        "masculine"
      ]
    }
  ],
  "word": "Nguyen"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "vi",
        "3": "Nguyễn"
      },
      "expansion": "Vietnamese Nguyễn",
      "name": "bor"
    }
  ],
  "etymology_text": "Borrowed from Vietnamese Nguyễn.",
  "forms": [
    {
      "form": "Nguyens",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "Nguyens"
      },
      "expansion": "Nguyen (plural Nguyens)",
      "name": "en-proper noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "name",
  "senses": [
    {
      "categories": [
        "English entries with incorrect language header",
        "English lemmas",
        "English proper nouns",
        "English surnames",
        "English terms borrowed from Vietnamese",
        "English terms derived from Vietnamese",
        "English terms with homophones",
        "English uncountable nouns",
        "Entries with translation boxes",
        "Pages with 3 entries",
        "Rhymes:English/ujən",
        "Rhymes:English/ujən/3 syllables",
        "Rhymes:English/ɛn",
        "Rhymes:English/ɛn/2 syllables",
        "Rhymes:English/ɪn",
        "Rhymes:English/ɪn/1 syllable",
        "Rhymes:English/ɪn/2 syllables",
        "Rhymes:English/ʉwən",
        "Rhymes:English/ʉwən/2 syllables",
        "Terms with Bulgarian translations",
        "Terms with Cantonese translations",
        "Terms with Czech translations",
        "Terms with French translations",
        "Terms with German translations",
        "Terms with Hebrew translations",
        "Terms with Japanese translations",
        "Terms with Khmer translations",
        "Terms with Korean translations",
        "Terms with Latin translations",
        "Terms with Mandarin translations",
        "Terms with Russian translations",
        "Terms with Thai translations",
        "Terms with Vietnamese translations"
      ],
      "glosses": [
        "A Vietnamese surname."
      ],
      "links": [
        [
          "surname",
          "surname"
        ]
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/wɪn/"
    },
    {
      "ipa": "/n(ə)ˈwɪn/"
    },
    {
      "ipa": "/ˈnjʉwən/",
      "tags": [
        "General-Australian"
      ]
    },
    {
      "ipa": "/wɛn/",
      "tags": [
        "General-Australian"
      ]
    },
    {
      "ipa": "/nuˈjɛn/",
      "tags": [
        "US"
      ]
    },
    {
      "ipa": "/n(ə)-/",
      "tags": [
        "US"
      ]
    },
    {
      "ipa": "/wɛn/",
      "tags": [
        "US"
      ]
    },
    {
      "ipa": "/ŋwɪn/",
      "tags": [
        "uncommon"
      ]
    },
    {
      "ipa": "/n(ə)ˈɡujən/",
      "tags": [
        "uncommon"
      ]
    },
    {
      "ipa": "/ɲuˈɛn/",
      "tags": [
        "uncommon"
      ]
    },
    {
      "rhymes": "-ɪn"
    },
    {
      "rhymes": "-ʉwən"
    },
    {
      "rhymes": "-ɛn"
    },
    {
      "rhymes": "-ujən"
    },
    {
      "homophone": "win"
    },
    {
      "homophone": "wynn"
    },
    {
      "homophone": "winne"
    }
  ],
  "translations": [
    {
      "code": "bg",
      "lang": "Bulgarian",
      "roman": "Nguen",
      "sense": "Vietnamese surname",
      "word": "Нгуен"
    },
    {
      "code": "yue",
      "lang": "Chinese Cantonese",
      "roman": "jyun²",
      "sense": "Vietnamese surname",
      "word": "阮"
    },
    {
      "code": "cmn",
      "lang": "Chinese Mandarin",
      "roman": "Ruǎn",
      "sense": "Vietnamese surname",
      "word": "阮"
    },
    {
      "code": "cs",
      "lang": "Czech",
      "sense": "Vietnamese surname",
      "word": "Nguyen"
    },
    {
      "code": "cs",
      "lang": "Czech",
      "sense": "Vietnamese surname",
      "word": "Nguyên"
    },
    {
      "code": "fr",
      "lang": "French",
      "sense": "Vietnamese surname",
      "word": "Nguyen"
    },
    {
      "code": "fr",
      "lang": "French",
      "sense": "Vietnamese surname",
      "word": "Nguyên"
    },
    {
      "code": "de",
      "lang": "German",
      "sense": "Vietnamese surname",
      "tags": [
        "feminine",
        "masculine"
      ],
      "word": "Nguyen"
    },
    {
      "code": "he",
      "lang": "Hebrew",
      "sense": "Vietnamese surname",
      "word": "נגויין"
    },
    {
      "code": "ja",
      "lang": "Japanese",
      "roman": "Guen",
      "sense": "Vietnamese surname",
      "word": "グエン"
    },
    {
      "code": "km",
      "lang": "Khmer",
      "roman": "ngviən",
      "sense": "Vietnamese surname",
      "word": "ង្វៀន"
    },
    {
      "code": "ko",
      "lang": "Korean",
      "roman": "eung'uyen",
      "sense": "Vietnamese surname",
      "word": "응우옌"
    },
    {
      "code": "ko",
      "lang": "Korean",
      "roman": "eung'wen",
      "sense": "Vietnamese surname",
      "word": "응웬"
    },
    {
      "code": "la",
      "lang": "Latin",
      "sense": "Vietnamese surname",
      "tags": [
        "New-Latin",
        "masculine"
      ],
      "word": "Nguyē̆nus"
    },
    {
      "code": "la",
      "lang": "Latin",
      "sense": "Vietnamese surname",
      "tags": [
        "New-Latin",
        "feminine"
      ],
      "word": "Nguyē̆na"
    },
    {
      "code": "ru",
      "lang": "Russian",
      "roman": "Ngujen",
      "sense": "Vietnamese surname",
      "word": "Нгуен"
    },
    {
      "code": "th",
      "lang": "Thai",
      "roman": "ngwǐian",
      "sense": "Vietnamese surname",
      "word": "เหงวียน"
    },
    {
      "code": "th",
      "lang": "Thai",
      "roman": "ngǐian",
      "sense": "Vietnamese surname",
      "word": "เหงียน"
    },
    {
      "code": "vi",
      "lang": "Vietnamese",
      "sense": "Vietnamese surname",
      "word": "Nguyễn"
    }
  ],
  "word": "Nguyen"
}

{
  "etymology_templates": [
    {
      "args": {
        "1": "fr",
        "2": "vi",
        "3": "Nguyễn"
      },
      "expansion": "Borrowed from Vietnamese Nguyễn",
      "name": "bor+"
    }
  ],
  "etymology_text": "Borrowed from Vietnamese Nguyễn.",
  "head_templates": [
    {
      "args": {
        "g": "mf"
      },
      "expansion": "Nguyen m or f",
      "name": "fr-proper noun"
    }
  ],
  "lang": "French",
  "lang_code": "fr",
  "pos": "name",
  "senses": [
    {
      "categories": [
        "French 2-syllable words",
        "French 3-syllable words",
        "French entries with incorrect language header",
        "French feminine nouns",
        "French lemmas",
        "French masculine nouns",
        "French nouns with multiple genders",
        "French proper nouns",
        "French surnames",
        "French surnames from Vietnamese",
        "French terms borrowed from Vietnamese",
        "French terms derived from Vietnamese",
        "French terms with IPA pronunciation",
        "Pages with 3 entries"
      ],
      "glosses": [
        "a surname from Vietnamese"
      ],
      "links": [
        [
          "surname",
          "surname"
        ]
      ],
      "tags": [
        "feminine",
        "masculine"
      ],
      "wikipedia": [
        "fr:Nguyen"
      ]
    }
  ],
  "sounds": [
    {
      "ipa": "/ɛn.ɡɥi.jɛn/"
    },
    {
      "ipa": "/nɥi.jɛn/"
    },
    {
      "ipa": "/ɡɥi.jɛn/"
    },
    {
      "audio": "Fr-Nguyen.oga",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/3/3c/Fr-Nguyen.oga/Fr-Nguyen.oga.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/3/3c/Fr-Nguyen.oga",
      "text": "Paris"
    },
    {
      "audio": "Fr-Normandie-Nguyen.ogg",
      "mp3_url": "https://upload.wikimedia.org/wikipedia/commons/transcoded/9/98/Fr-Normandie-Nguyen.ogg/Fr-Normandie-Nguyen.ogg.mp3",
      "ogg_url": "https://upload.wikimedia.org/wikipedia/commons/9/98/Fr-Normandie-Nguyen.ogg",
      "text": "Normandy"
    }
  ],
  "synonyms": [
    {
      "word": "Nguyên"
    }
  ],
  "word": "Nguyen"
}

Download raw JSONL data for Nguyen meaning in All languages combined (6.4kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2024-09-22 from the enwiktionary dump dated 2024-09-20 using wiktextract (af5c55c and 66545a6). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.